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ABSTRACT 

This is the first exploration of the galaxy distribution function at redshifts greater than about 0.1. 
Redshifts are based on the North and South GOODS Catalogs. In each catalog we examine clustering 
in the two redshift bands 0.47 < z < 0.8 and 0.9 < z < 1.5. The mean redshifts of the samples 
in these bands are about 0.6 and 1.1. Our main result is that at these redshifts the galaxy spatial 
distribution function /y(iV) has the form predicted by gravitational quasi-equilibrium dynamics for 
cosmological many-body systems. This constrains related processes such as galaxy merging and the 
role of dark matter in the range of these redshifts. 

Subject headings: dark matter-galaxies: clusters: general- Galaxies:statistics-gravitation-large scale 
structure of universe. 



1. INTRODUCTION 

At relatively low redshifts, z < 0.1, several surveys 
have determined the distribution function of galaxies and 
their associated dark matter. These include the Zwicky 
Catalog (Crane & Saslaw 1986) the UGC and ESO Cat- 
alogs (Lahav & Saslaw 1992), the SSRS Catalog in three 
dimensions (Fang & Zou 1994), the Pisces- Persus Su- 
percluster (Saslaw & Haque-Copilah 1998) and most re- 
cently the 2MASS Catalog with about 650,000 galaxies 
(Sivakoff & Saslaw 2005). All these catalogs have yielded 
the galaxy distribution function, fy(N), which is the 
probability that a volume V, or area A, placed randomly 
in space or on the sky contains N galaxies. This is a 
very powerful statistical description of the positions of 
galaxies both in space or on the sky. It includes infor- 
mation about the correlation functions to all orders, as 
well as characterizing voids and near neighbor positions, 
filaments, the average shapes of clusters, and counts in 
cells (Saslaw 2000). 

Under a wide range of conditions, the galaxy distri- 
bution evolves dynamically in quasi-equilibrium and its 
distribution function has the form (Saslaw & Hamilton 
1984; Saslaw & Fang 1996; Ahmad, Saslaw & Bhat 2002) 
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This result can be derived from either the thermody- 
namics or the statistical mechanics of the cosmological 
gravitational many-body system. The expected number 
is 

N — nV (2) 

and 

W 

b = -w < 3 > 

is the ratio of average gravitational correlation energy to 
twice the average kinetic energy of peculiar velocities in 



the system. Equation (1) agrees very closely with obser- 
vational results in the catalogs mentioned above (with no 
free parameters), and also with N-body computer simu- 
lations designed to test the theory on which it is based 
(reviewed in Saslaw 2000). 

The fundamental reason why quasi-equilibrium statis- 
tical mechanics provides a good description of galaxy 
clustering is that the cosmological many-body problem 
is the basic system underlying this clustering and it con- 
tains two different timescales. One is the local dynamical 
timescale on which clustering forms and clusters interact 
in overdense regions. The other is the global timescale 
for macroscopic properties to change. These macroscopic 
properties such as density and pressure are averaged over 
regions which are large enough to contain a statistically 
homogeneous distribution of clusters. Consequently they 
initially change on about the Hubble timescale, but as 
they gradually virialize over larger and larger length- 
scales, they change even more slowly. As the N-body 
simulations mentioned above have shown, this dispar- 
ity of timescales produces quasi-equilibrium evolution of 
clustering. (See Saslaw 2008 for a recent review of this 
and related topics.) 

Of course, there is more to galaxy clustering than 
the gravitational interaction of point masses. Individual 
galaxy dark matter haloes have been incorporated into 
the quasi-equilibrium theory (Ahmad, Saslaw & Bhat 
2002; Leong & Saslaw 2004). Large dark matter haloes 
containing many galaxies have been produced in many 
computer simulations (eg. Guo & White 2008). Unfor- 
tunately these simulations, usually depending on many 
assumptions and parameters, are seldom compared with 
equation (1) or with the observed spatial and peculiar ve- 
locity distribution functions of galaxies. So it is difficult 
to determine their relevance, even though many detailed 
implications of these simulations for galaxies' properties 
can be compared with observations. 
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Although the quasi-equilibrium theory has usually 
been used for galaxies of identical masses, it has also 
been extended to systems containing different masses 
and compared with N-body simulations (Ahmad, Ma- 
lik & Masood 2006). A mass range is a secondary effect 
because most galaxy clustering is produced by the mean 
field of many neighbors rather than by the individual field 
of a nearest neighbor. The mass-morphology-luminosity 
relation for galaxies may show some dependence of the 
distribution function on mass at low rcdshifts, but this 
also appears to be a secondary effect (Lahav & Saslaw 
f992). At present, there is not enough morphological 
data for the GOODS or other high rcdshift catalogs to 
examine this segregation at higher redshifts. 

At larger redshifts around z ~ 0.5 and z ~ 1.1, one 
might expect to begin to find departures from the form 
of equation (1), and observations so far have not been 
used to determine how well this form of fy (N) applies at 
higher rcdshifts. If it does apply, then theoretical analy- 
ses can predict b(z) in simple cases (Saslaw 1986; Saslaw 
& Edgar 2000). However several effects could modify the 
fundamental nature of equation (1). Examples are: 1) 
galaxy mergers which could distort fy(N), 2) the distri- 
bution of dark matter which may have evolved differently 
from that of luminous baryonic galaxies, and 3) the initial 
distribution of galaxies may have been outside the range 
(basin of attraction) which could have evolved into the 
form of equation (1). 

Here we report the first observational determinations 
of fv(N) for redshifts greater than about 0.1, in partic- 
ular for two ranges 0.47 < z < 0.80, and 0.9 < z < 1.5. 
At these redshifts, different catalogs based on different 
magnitude or color cutoffs will contain different num- 
bers of galaxies. However, provided the selection effects 
are homogeneous over the catalog and over the sky, they 
can be normalized out by their value of N. Other more 
subtle known biases can also be accounted for explicitly 
(Lahav & Saslaw 1992). To help minimize these compli- 
cations it is most useful to choose samples large enough 
to provide meaningful statistics, but over a small enough 
rcdshift range to avoid evolutionary smearing effects. At 
low redshifts this is easy to do, but at large redshifts, 
with presently available data, it involves practical com- 
promises. 

In Section 2 , we describe our samples, and in Section 

3 we determine their distribution functions. In Section 

4 wc determine how the value of b depends on the size 
of the cells at these redshifts. Then Section 5 discusses 
some implications of the results. 

2. THE OBSERVED SAMPLES 

Our analysis is mainly based on the GOODS Catalogs 
using both its North and South components separately to 
check on its homogeneity and statistical uncertainty. The 
GOODS South Catalog has four components, the VVDS 
(VIMOS VLT Deep Survey) (Le Fevre, et al. 2004), the 
ESOl (Vanzella, et al. 2005), ES02 (Vanzella, et al. 
2006), and a spectroscopic redshift survey (Ravikumar, 
et al. 2007), all having spectra taken with VIMOS on 
the ESO VLT. These provide 1599, 234, 501, and 961 red- 
shift determinations for each of the catalogs respectively. 
We cross correlate these with a 0.5 arc second search 
radius to eliminate multiple determinations of the same 
rcdshift. This leaves 950 distinct objects in the redshift 
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Fig. 1.— The distribution of (a) 950 galaxies in the GOODS 
South with 0.47 < z < 0.8, (b) 882 galaxies with 0.9 < z < 1.5, 
(c) 685 galaxies in the GOODS North with 0.47 < z < 0.8, (d) 433 
galaxies with 0.9 < z < 1.5. These are all projected in equatorial 
coordinates. 
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redshift 

Fig. 2. — The redshift distribution of galaxies in North (top) 
and South (bottom) catalogs. For each catalog, the galaxies in the 
redshift samples we use are shown as the dark filled histograms. 

range between 0.47 and 0.8 as well as 882 objects in the 
redshift range between 0.9 and 1.5. These ranges were 
selected to include a large enough number of galaxies 
for reasonable statistics in a small enough redshift band 
that evolution would be unlikely to dominate. This is 
clearly a compromise which could be improved in future 
larger but at least equally homogeneous catalogs. Figure 
1 (a and b) shows these two redshift samples projected 
onto the sky. The GOODS North catalog contains a ma- 
jority of objects from the Team Keck Treasury Redshift 
Survey (TKRS) (Wirth, et al. 2004). They measured 
spectroscopic redshifts of 1440 galaxies and AGNs (plus 
96 stars) in this region. There are also 434 other redshifts 
in this region obtained by using the LRIS spectrograph 
(Oke, ct al. 1995) on the twin 10 m telescopes of the 
W. M. Keck Observatory (Cohen, et al. 1996, 2000; 
Cowie, et al. 1996; Steidel, et al. 1996; Lowenthal, et 
al. 1997; Phillips, ct al. 1997; Moustakas, Zepf & Davis 
1997; Cohen 2001; Dawson, et al. 2001) and DEIMOS 
spectrograph (Faber, et al. 2003) on the Keck twin tele- 
scopes (Cowie, et al. 2004). These give a total of 1970 
redshifts, of which 685 are between 0.47 and 0.80 and 433 
are between 0.9 and 1.5, comparable with the numbers 
in these regions in the South catalog. Figure 1 (c and d) 
shows their distribution on the sky. Figure 2 shows the 
redshift distribution for the two ranges in the North and 
South, as well as for all redshifts in both regions. The 
mean redshifts of the four selected samples are 0.61 and 
1.06 in the North, and 0.65 and 1.14 in the South. 

In addition to illustrating the scales of the samples, 
figure (1) shows that they contain a range of voids, fila- 



mentary structures, underdense regions and clusters. It 
also shows the inhomogeneity of the South samples which 
contain four surveys. In particular, the GOODS South 
sample for high redshifts (figure (lb)) has an average 
density which increases systematically with increasing 
right ascension, even in this small area. The GOODS 
North sample, based mainly on one survey, is more ho- 
mogeneous, allowing for greater fluctuations around its 
smaller average density. 

3. THE GALAXY DISTRIBUTION FUNCTIONS AT Z ~ 0.63 

AND Z ~ 1.1 

To obtain the distribution function, we map the galax- 
ies onto a Hammer- Aitoff equal area projection (e.g. Cal- 
abretta & Greisen 2002), divide the area into square cells 
of a given angular size and count the number of galaxies 
in the volume projected into each cell. The samples are 
not yet large enough to examine the areas for complete- 
ness and statistical homogeneity in the usual ways (cf. 
Sivakoff & Saslaw 2005). However the consistency of our 
results below suggests that equation (1) will be a good 
representation of the actual distribution function. 

Figure (3) shows examples of the resulting histograms 
for counts of galaxies in cells in the two redshift ranges of 
both the North and South Catalogs. The redshifts and 
cell sizes are labeled on the histograms. Other examples 
are similar, although as the cells become larger or the 
number of cells decreases, the fluctuations naturally in- 
crease. The solid line is the theoretical curve of equation 
(1) in which N is determined directly from the data. The 
value of b may also be determined directly from the data 
using its relation to the variance of counts in cells having 
volume V projected onto the sky: 

Equation (4) is the variance of counts in cells and it fol- 
lows either directly from equation (1) or from its mo- 
ments, or from its generating function (see Saslaw 2000). 

In addition, we can find b using a least squares fit of 
equation (1) to the histograms. The values of b for fig- 
ure (3) obtained by least squares fitting are 0.17, 0.12, 
0.15, 0.22 from left to right and top to bottom. In the 
Figures we use the values of b from the observed variance 
of counts in cells. Thus the good agreement between ob- 
servations and theory is obtained without the use of any 
free parameters. 

Our results in figure (3), as well as previous observa- 
tions of spatial and velocity distribution functions at low 
redshifts, are a challenge for the usual computer 

simulations to reproduce (without too many free param- 
eters). For example, one of the important consequences 
of figure (3) is the role of galaxy mergers in altering their 
distribution function over time. From the viewpoint of 
the theory behind equation (1), the robustness of /V(iV) 
to mergers can be understood analytically (Yang, Saslaw, 
Leong & Chan 2008, in preparation). 

4. THE OBSERVED DEPENDENCE OF b ON SCALE AND 

REDSHIFT 

The value of b is known to depend on the size of the 
cells for which it is measured. If cells are so small that 
they usually contain only zero or one galaxy, their counts- 
in-cells will have a nearly Poisson distribution for which 
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Fig. 3. — Theoretical curve (continuous line) of fv(N) on the observed histograms for square cells of GOODS North (top) and GOODS 
South (bottom) for two redshift ranges. 
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Properties of distribution functions from counts in cells. 
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Two-point correlation 
function 
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Fig. 4. — The value of b for cells of different angular sizes for the 
two redshift samples in the North and South GOODS catalogs. 

b ~ and {(AN) 2 } ~ N. Greater values of b occur for 
larger departures from Poisson statistics, as equations 
(1) and (4) indicate. Another way of visualizing this is 
by relating b to the two-galaxy correlation function: 



h W 

b= -2K 



2nGm 2 n 
3T 



/« 

Jv 



fi 7 T,r)-r 2 dr (5) 
r 



where K is the kinetic energy and T is the temperature 
given by the peculiar velocity dispersion relative to the 
average Hubble expansion. (For a detailed review see 
Saslaw 2000.) For larger volumes, there is a greater 
contribution of the two-galaxy correlation function £(r) 
to the integral of the gravitational correlation energy W , 
and b increases. On very large scales, the correlations be- 
come small and contribute no further to the integral in 
equation (5). Thus the value of b reaches its asymptotic 
limit on these large scales, provided they are initially 
uncorrelated. At low redshifts, b(r) in the 2MASS cata- 
log can be used with equation (5) to determine the two- 
galaxy correlation function. Comparison of the result 
with the standard direct determination shows excellent 
agreement (Sivakoff & Saslaw 2005). 

The cell size d(Mpc) is related to its angular size 
^(radians) at a redshift z by (Coles & Lucchin 2002) 
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where we take the Hubble constant Hq = 70 km s 1 
Mpc~ l and q = 0.5. 

Figure (4) shows b(9) for cells of different angular sizes 
(in arc seconds) for the two redshift samples in the North 
and South GOODS catalogs. Table (1) gives more de- 
tailed information on these results. This information is 
useful for understanding the physical scales of the cells 
and the numbers of galaxies and cells being analyzed. It 
also gives the numbers for figure (3) which are used to cal- 
culate the two-galaxy correlation function (in figure (5)) 
from equation (5). Notice that for the same angular size, 
the physical sizes of cells in the South is slightly greater 
than in the North, because of the slightly greater average 
redshift of the Southern samples. We have also exam- 
ined the effect of the strong redshift peak in the South 
at z = 0.74 by reducing the upper redshift to z = 0.73 
which eliminates the peak in that nearer sample. The 
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result is to decrease the values of b for the same size cells 
by between 0.01 for the smallest cells and 0.1 for cells of 
about 220 arc sec in length, usually an effect of about 
15 - 20%. 

Figure (5) compares b(9) with values based on the 
two-galaxy angular correlation function, W(s). Lahav & 
Saslaw 1992 derived b(8) for square cells of size w = 9x9 
deg 2 

b{9) = l-(l + Nsl- 1 C 1 9 1 -^y 1/2 (7) 

where W(s) = (s/so) 1 7 , N is the expected number of 
galaxies in a cell, and C 7 is a coefficient to be evaluated 
numerically. Lahav & Saslaw 1992 calculated C 7 's cor- 
responding to four different values of 7 = 1, 1.68, 1.8, 
and 2 to be 1, 1.87, 2.25, and 2.97 respectively. We used 
the lower redshift sample in the North (which seems the 
most homogeneous) and then fixed C 7 to each of these 
four values to fit equation (7) to the data and find 7 and 
s - As table (2) shows by changing C 7 to its four nu- 
merically calculated values the fitted values of 7 do not 
change but so decreases. 

To determine whether cells near the edges of each re- 
gion could have significant effects on the results, we omit- 
ted them and found that they did not change the trend 
of the values of b in the two redshift intervals. 

We also tested the effects of different cross-correlation 
radii among the different catalogs which constitute the 
GOODS North and South catalogs. Increasing this cross- 
correlation radius from 0.5 to 0.75, 1.0, and 1.5 arcsec- 
onds gave 3009, 3005, 3003 and 2994 independent objects 
respectively. So cross-correlation radii in this range have 
negligible effects. 

In the North Catalog, the majority of the data (1536 
secure redshifts) come from the TKRS redshift survey, 
and the other 434 are from other redshift surveys. To 
examine possible effects of differences among these cat- 
alogs, we repeated all the analysis using only the TKRS 
survey. This gave small changes in the values of b but 
did not affect the trend of values of b between the two 
redshift intervals. 

As the total number of cells, n, in Table (1) decreases 
below about 150 (conservatively) the fv(N) histograms 
become less smooth and their corresponding values of b 
become less reliable. The only ways to explore these 
regimes more accurately are with large homogeneous 
samples covering greater areas, and with a larger number 
of such samples. 

5. DISCUSSION 

Our most striking result is that at redshifts up to about 
z = 1.5 the form of the spatial galaxy distribution func- 
tion is remarkably similar to its form at the present time. 
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Fig. 5. — The points show the b values for different cell 
sizes in North with 0.47 < z < 0.8. The curve is the pre- 
dicted b(9) fitted on these points according to b{9) = 1 — 

(l + JVs^ 1 C 7 6' 1 -t) 1/2 (Lahav & Saslaw 1992) with s = 
0.0017° and 7 = 1.68. 

Both these forms were predicted by the gravitational sta- 
tistical mechanics and thermodynamics of the cosmolog- 
ical many-body system. This indicates that although 
merging and dark matter can be important for the evo- 
lution of individual galaxies, they do not dominate the 
forms of the large scale galaxy distribution. The reasons 
for this lack of dominance may be able to place important 
constraints on merging and dark matter. 

To obtain these constraints, we need to determine how 
b(r, z) depends on the scale r, and how it evolves with 
rcdshift. The theory predicts both these dependencies 
(e.g. Saslaw 2000; Ahmad, Saslaw & Bhat 2002). In par- 
ticular b should decrease with increasing redshift. Fig- 
ure (4) and Table (1) show that this decrease holds in 



the North GOODS catalog, but not in the South. The 
simplest explanation for this difference may be that the 
South region is a compilation of four distinct catalogs, 
none of which dominates. This may encourage more in- 
homogeneity than in the North region which is domi- 
nated by one catalog. Indeed Vanzella, et al. (2005, 
2006) and Ravikumar, et al. (2007) have clearly noted 
the inhomogeneity of structure in the South region. This 
may be a region of excess clustering which would produce 
an unusually large apparent local value of b. (We recall 
that the usual value of b represents an ensemble average 
over many regions which cover a representative range of 
clustering.) On the other hand, the decrease of b with 
z in the North region behaves qualitatively as expected. 
Whether it agrees quantitatively with the gravitational 
many-body theory is being investigated elsewhere (Yang, 
Saslaw, Leong & Chan (2008, in preparation)). 

When b(r) can be determined more accurately, using a 
larger number of larger homogeneous catalogs, it will also 
provide information on the evolution of the two-galaxy 
correlation function. (See Sivakoff & Saslaw 2005 for an 
example of the technique to accomplish this.) This, in 
turn, can then be related to many computer simulations 
of galaxy clustering which measure the two-galaxy cor- 
relations under a variety of conditions. 
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